Codeword Distinguishability in Minimum Diversity Decoding
نویسندگان
چکیده
We re-take a coding-theoretic notion which goes back to Cl. Shannon: codeword distinguishability. This notion is standard in zero-error information theory, but its bearing is definitely wider and it may help to better understand new forms of coding, as we argue below. In our approach, the underlying decoding principle is very simple and very general: one decodes by trying to minimise the diversity (in the simplest case the Hamming distance) between a codeword and the output sequence observed at the end of the noisy transmission channel. Symmetrically and equivalently, minimum-diversity decoders and codeword distinguishabilities may be replaced by maximum-similarity decoders and codeword confusabilities. The operational meaning of codeword distinguishability is made clear by a reliability criterion, which generalises the well-known criterion on minimum Hamming distances for error-correction codes. We investigate the formal properties of distinguishabilities versus diversities; these two notions are deeply related, and yet essentially different. An encoding theorem is put forward, which supports and suggests old and new code constructions. In a list of case studies, we examine channels with crossovers and erasures, or with crossovers, deletions and insertions, a channel of cryptographic interest, and the case of a few “odd distances” taken from DNA word design. ∗A. Sgarro is with the Department of Mathematics and Computer Science, University of Trieste, 34100 Trieste, Italy. E-mail: [email protected] †L. Bortolussi is with the Department of Mathematics and Computer Science, University of Udine, 33100 Udine, Italy. E-mail: [email protected] ‡Partly supported by GNCS, INdAM.
منابع مشابه
ar X iv : c s / 06 06 05 1 v 1 [ cs . I T ] 1 2 Ju n 20 06 Minimum Pseudo - Weight and Minimum Pseudo - Codewords of LDPC Codes ∗
In this correspondence, we study the minimum pseudo-weight and minimum pseudo-codewords of low-density parity-check (LDPC) codes under linear programming (LP) decoding. First, we show that the lower bound of Kelly, Sridhara, Xu and Rosenthal on the pseudo-weight of a pseudo-codeword of an LDPC code with girth greater than 4 is tight if and only if this pseudo-codeword is a real multiple of a co...
متن کاملFast Reliability-based Algorithm of Finding Minimum-weight Codewords for LDPC Codes
Despite the NP hardness of acquiring minimum distance dm for linear codes theoretically, in this paper we propose one experimental method of finding minimum-weight codewords, the weight of which is equal to dm for LDPC codes. One existing syndrome decoding method, called serial belief propagation (BP) with ordered statistic decoding (OSD), is adapted to serve our purpose. We hold the conjecture...
متن کاملSpearman Permutation Distances and Shannon's Distinguishability
Spearman distance is a permutation distance which might prove useful for coding, e.g. for codes on permutations. It turns out, however, that the Spearman geometry of strings is rather unruly from the point of view of error correction and error detection, and special care has to be taken to discriminate between the two notions of codeword distance and codeword distinguishability. All of this str...
متن کاملLecture 18
Turbo codes, introduced by Berrou, Glavieux and Thitimajshima in 1993 [1], are widely used codes that exhibit excellent empirical results, i.e., the decoding algorithm produces a codeword very quickly. While turbo codes are very popular, theoretical analysis of their performance has been unsupportive. Codewords have extremely small minimum distance, and no one has been able to prove that the de...
متن کاملGroup-based joint coding and embedding technique for multimedia fingerprinting
This paper proposes a group-based fingerprinting scheme employing a joint coding and embedding strategy to trace multimedia distribution and proactively prevent the leak of multimedia information. Taking advantage of the prior knowledge on the collusion pattern, we construct compact fingerprints that consist of user sub-codeword and group sub-codeword and are embedded in host signal via spread ...
متن کامل